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3 <110> APPLICANT: Yoshinori Watanabe 

5 <120> TITLE OF INVENTION: Novel centromeric protein SHUGOSHIN 

7 <130> FILE REFERENCE: 4439-4043 
C--> 9 <140> CURRENT APPLICATION NUMBER: US/10/581 , 158 
C--> 9 <141> CURRENT FILING DATE: 2006-05-31 

9 <150> PRIOR APPLICATION NUMBER: JP2003 -401943 
10 <151> PRIOR FILING DATE: 2003-12-01 

12 <150> PRIOR APPLICATION NUMBER: JP2004 -279450 

13 <151> PRIOR FILING DATE: 2004-09-27 
15 <160> NUMBER OF SEQ ID NOS : 45 

17 <170> SOFTWARE: Patentln version 3.1 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: 960 

21 <212> TYPE: DNA 

22 <213> ORGANISM: yeast 

24 <400> SEQUENCE: 1 

25 atgaactttc aatttataaa ttcaaatata aacaatgaag ataaattgcc gatggagtcg 60 
27 ttgaaaaaga aatttttaaa acaaaatcgt gaaattataa aaataaatac tcagctttct 120 

2 9 ataaaaatta gagaatctga aaacgaaatt caagatttga tacaagaaaa tttcactttg 180 
31 aaaagttatt tggttaaact tgaagctcga tttcgcaatc aatctcaaac tgaggacttg 240 

3 3 ttaaaaaact tctttcctga gatacaaacc attcacaaaa agatttcaca agtgcaaagt 3 00 
35 ttactgaaga ttatagagaa aaagtgttca tcagatttcc tcgaagcgaa tgtaaaaagt 360 
37 caatttacaa cctgtgaaaa taaagattcg aaagaagatt atcagatttt gcataataaa 420 
39 cgcttggagt atgtatcatt taatgatgaa cttaaaagtc tcgaaacagg gcaaccattg 480 
41 tattgttttc aagatttcca aaaaaaagtc catggtcctc cggctctatc tgaaaaacct 540 
43 ggaaaatgta tattaaaaga taaaaccaat gcccacgtaa acaaaatacc acaagatgag 600 
45 gtgaattact cattgccgca aaaaaatatc accatctttt caaaggaatt aaaagaaaac 660 
47 gaatttgaat ccatcaacga gggcgaaact gaagaagaaa aggctaaaac atcaaatgtt 720 
49 tgtgtttgta ttccttgtaa aagtgctgaa cagataactg accttaaagg acaagcaacc 780 
51 ggagacagct ccccatgtga ttttgaagaa tctcaaccaa ggattaatgg acgtgaaaaa 840 
53 ctaagacgat cagtcaaagt gataaactat gcaataccca gtttgcgaac taaactacga 900 
55 cgagactttg acttaccatc tgatagaaaa cgcaaacgac atcccagagg caaagcataa 960 

58 <210> SEQ ID NO: 2 

59 <211> LENGTH: 319 

60 <212> TYPE: PRT 

61 <213> ORGANISM: yeast 
63 <400> SEQUENCE: 2 

65 Met Asn Phe Gin Phe lie Asn Ser Asn lie Asn Asn Glu Asp Lys Leu 

66 1 5 10 15 

69 Pro Met Glu Ser Leu Lys Lys Lys Phe Leu Lys Gin Asn Arg Glu lie 

70 20 25 30 

73 He Lys He Asn Thr Gin Leu Ser He Lys He Arg Glu Ser Glu Asn 

74 35 40 45 
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77 Glu lie Gin Asp Leu lie Gin Glu Asn Phe Thr Leu Lys Ser Tyr Leu 


78 50 








55 








60 








81 Val Lys Leu Glu Ala Arg Phe Arg Asn Gin Ser Gin Thr Glu Asp Leu 


82 65 








70 








75 








80 


85 Leu Lys Asn Phe Phe Pro Glu : 


He Gin Thr He His Lys Lys He Ser 


86 






85 








90 








95 


89 Gin Val Gin Ser Leu Leu Lys : 


He He Glu Lys Lys Cys Ser Ser Asp 


90 




100 








105 








110 




93 Phe Leu Glu Ala Asn Val Lys ! 


Ser Gin Phe Thr Thr Cys Glu Asn Lys 


94 


115 








120 








125 






97 Asp Ser Lys Glu Asp Tyr Gin : 


He Leu His Asn Lys Arg Leu Glu Tyr 


98 130 








135 








140 








101 Val 


Ser 


Phe 


Asn 


Asp 


Glu 


Leu 


Lys 


Ser 


Leu 


Glu 


Thr 


Gly 


Gin 


Pro Leu 


102 145 










150 










155 








160 


105 Tyr 


Cys 


Phe 


Gin 


Asp 


Phe 


Gin 


Lys 


Lys 


Val 


His 


Gly 


Pro 


Pro 


Ala Leu 


106 








165 










170 










175 


109 Ser 


Glu 


Lys 


Pro 


Gly 


Lys 


Cys 


He 


Leu 


Lys 


Asp 


Lys 


Thr 


Asn 


Ala His 


110 






180 










185 










190 




113 Val 


Asn 


Lys 


He 


Pro 


Gin 


Asp 


Glu 


Val 


Asn 


Tyr 


Ser 


Leu 


Pro 


Gin Lys 


114 




195 










200 










205 






117 Asn 


He 


Thr 


He 


Phe 


Ser 


Lys 


Glu 


Leu 


Lys 


Glu 


Asn 


Glu 


Phe 


Glu Ser 


118 


210 










215 










220 








121 lie 


Asn 


Glu 


Gly 


Glu 


Thr 


Glu 


Glu 


Glu 


Lys 


Ala 


Lys 


Thr 


Ser 


Asn Val 


122 225 










230 










235 








240 


125 Cys 


Val 


Cys 


He 


Pro 


Cys 


Lys 


Ser 


Ala 


Glu 


Gin 


He 


Thr 


Asp 


Leu Lys 


126 








245 










250 










255 


129 Gly 


Gin 


Ala 


Thr 


Gly 


Asp 


Ser 


Ser 


Pro 


Cys 


Asp 


Phe 


Glu 


Glu 


Ser Gin 


130 






260 










265 










270 




13 3 Pro 


Arg 


He 


Asn 


Gly 


Arg 


Glu 


Lys 


Leu 


Arg 


Arg 


Ser 


Val 


Lys 


Val He 


134 




275 










280 










285 






137 Asn 


Tyr 


Ala 


He 


Pro 


Ser 


Leu 


Arg 


Thr 


Lys 


Leu 


Arg 


Arg 


Asp 


Phe Asp 


138 


290 










295 










300 








141 Leu 


Pro 


Ser 


Asp 


Arg 


Lys 


Arg 


Lys 


Arg 


His 


Pro 


Arg 


Gly 


Lys 


Ala 


142 305 










310 










315 











145 <210> SEQ ID NO: 3 

146 <211> LENGTH: 1944 

147 <212> TYPE: DNA 

148 <213> ORGANISM: yeast 

150 <400> SEQUENCE: 3 

151 atgtcgaaag catctctttc 
153 tataaggaaa ttatacgaat 
155 gaaaatgaac ggttgctttc 
157 gagcaactcg aaaccgtgca 
159 cttaatcgat ttcatgaaga 
161 gaaatacaag acaccttcaa 
163 tctgaagacc ttgacgagga 
165 cagcatgatg tttccttacg 
167 ggagaaactg ctataaatga 
169 tcagcacctc agctaaaatc 



cccgaacgta 
aagcaaggca 
ggaaaatatc 
aaacgaaaac 
aacagataat 
accagtggag 
atccgtcgtg 
aaatttaagt 
acaaaaaaaa 
cggcaaagcc 



gaagacttga 
caatcaatta 
gatttgagga 
gaagaaaaca 
tttttatcaa 
gctaacttag 
aaagataccg 
ggaatagagg 
agagaagcta 
ctcccagctg 



aaaaaaagca 
gaattaaaga 
ctacagcgat 
aaacaaagtt 
aattaagtct 
cttacgatgt 
aagaaataat 
atgaaaatat 
atgttttttc 
attttgaaaa 



aattcgacag 
attgcagtta 
aaacttggaa 
agctgcatta 
ttgtcagcaa 
cgatacggat 
tgagcaagct 
aattgatgac 
cgacacgcaa 
tccttacaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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171 ctatccaatt cgaaacctgt aaataataat aatgaagata gagttgaagc ggttacttct 660 

173 gaaaataaat ctatcgattc tgctcctcag gaaaaaaatc atgaatacga aatcgttagt 720 

175 ccaaaatcat tatccaacaa aattaataat caagcagctg cacaaagaag aaccgaagaa 780 

177 gataatgcaa atggagttgc tcaagaagaa aatgagggtt cacaagaagc tcattttcat 840 

179 agcagaatac aatctgatac agtaatacaa agtacaccca ctaaacggaa atgggacgtt 900 

181 gacattcaaa ataaacaaat taatctggct tctgcagcta ccaatgttac cggttatgta 960 

183 tcggagaccg atagtcgccc caatcgcgca aactctttgg attctgctgt ccttcttgtg 1020 

185 caatcttcaa ataaaagtaa ccgaaatggg catcatattt cagatcctaa tttaaatagc 1080 

187 tccatatcgt tgaagtttgc gcctgaagat actgcgcata attcattaac ttcacaagag 1140 

189 aatgttgggc ctcaggttac gacgacttct ctgtcaaata tgactgttgc tgaatctcct 1200 

191 cgtacagaca ctccaaggga aataaacggg ttagtagact cttctgtcac taatgggaac 1260 

193 gaaaaatttt ctgtagaaat aatgaatgac tctaacaaaa ttggactgaa tcctaaatct 1320 

195 tttaccgacg aagagcggga aattttaaca ctttttcgaa atcctcccat gagactgtca 1380 

197 agtgaacctc catcttcaaa tggattttca atagcccatc ccaataattc tccgttacgt 1440 

199 ccgccatcgc tacaaggaat attgaatgct gaagatcgac cttacgaaat tgagccgtca 1500 

201 cgtagctcct ttgctaccaa cgatacgggc tcctataata atttggaact tctgtcatct 1560 

2 03 gtaacgaatt tgaaatcccc taatgagaac gatcgtgtga cgaaaactca gtcgcgaaga 162 0 

205 gaaacaaaag tgaaaaggcg aagaaaagct cggattcaag aaacttctga agaaagtaca 1680 

2 07 gtagtcaatg agccaaatga aaaacctgat ggaaggagcc gaagggaacg gaaaaaggtt 1740 

209 aattacgctt tgcctggatt aaggacgaaa ttaagacgga atttcgattt accttcagat 1800 

211 catgtaaaag ctaaaaaaac gagacgtgct cctaagaact ctgagaatga ttcagctacc 1860 

213 aaaacagaaa ccgcaaacat tacttctgaa gcacccacta cttcagaagt aacccttgaa 192 0 

215 aactccgaaa cccttaattt gtaa 1944 

218 <210> SEQ ID NO: 4 

219 <211> LENGTH: 647 

220 <212> TYPE: PRT 

221 <213> ORGANISM: yeast 
223 <400> SEQUENCE: 4 

225 Met Ser Lys Ala Ser Leu 

226 1 5 

22 9 Gin lie Arg Gin Tyr Lys 
230 20 

233 lie Arg lie Lys Glu Leu 

234 35 

23 7 Asn lie Asp Leu Arg Thr 
238 50 

241 Thr Val Gin Asn Glu Asn 

242 65 70 

245 Leu Asn Arg Phe His Glu 

246 85 

249 Leu Cys Gin Gin Glu lie 

250 100 

253 Leu Ala Tyr Asp Val Asp 

254 115 

257 Val Val Lys Asp Thr Glu 

258 130 

261 Ser Leu Arg Asn Leu Ser 

262 145 150 
265 Gly Glu Thr Ala He Asn 



Ser Pro Asn Val Glu 
10 

Glu He He Arg He 
25 

Gin Leu Glu Asn Glu 
40 

Thr Ala He Asn Leu 
55 

Glu Glu Asn Lys Thr 
75 

Glu Thr Asp Asn Phe 
90 

Gin Asp Thr Phe Lys 
105 

Thr Asp Ser Glu Asp 
120 

Glu He He Glu Gin 
135 

Gly He Glu Asp Glu 
155 

Glu Gin Lys Lys Arg 



Asp Leu Lys Lys Lys 
15 

Ser Lys Ala Gin Ser 
30 

Arg Leu Leu Ser Glu 
45 

Glu Glu Gin Leu Glu 
60 

Lys Leu Ala Ala Leu 
80 

Leu Ser Lys Leu Ser 
95 

Pro Val Glu Ala Asn 
110 

Leu Asp Glu Glu Ser 
125 

Ala Gin His Asp Val 
140 

Asn He He Asp Asp 
160 

Glu Ala Asn Val Phe 
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966 

Z, O O 








165 








170 










175 




9 6 Q 

O Z> 


Ser Asp 


Thr Gin 


Ser Ala 


Pro 


Gin 


Leu 


Lys 


Ser 


Glv 


Lvs 
1 


Ala 


Leu 


Pro 


9 70 






180 








185 










190 






9 71 


Ala Asp 


Phe Glu 


Asn Pro 


Tvr 

JL 


Asn 


Leu 


Ser 


Asn 


Ser 


LVS 
1 


Pro 


Val 


Asn 


Z / ft 






195 






200 










205 








9 7 7 

Z f / 


Asn 


Asn 


Asn Glu Asp Arg 


Val 


Glu 


Ala 


Val 


Thr 


Ser 


Glu 


Asn 


Lys 


Ser 


9 7ft 




210 






215 










220 










9 ftl 
o X 


He 


Asp 


Ser Ala 


Pro Gin 


Glu 


Lys 


Asn 


His 


Glu 


Tvr 


Glu 


He 


Val 


Ser 


9 R9 


225 






230 










235 










240 


9 ft R 
ZOO 


Pro 


Lys 


Ser Leu 


Ser Asn 


Lys 


He 


Asn 


Asn 


Gin 


Ala 


Ala 


Ala 


Gin 


Arg 


zoo 








245 








9^0 










255 




9 ft Q 
Z O 


Arg 


Thr 


Glu Glu Asp Asn 


Ala 


Asn 


Gly 


Val 


Ala 


Gin 


Glu 


Glu 


Asn 


Glu 


z y u 






260 








96^ 

^ u J 










270 






Z7J 


Gly Ser 


Gin Glu 


Ala His 






Cpr 
OC1 




He 


Gin 


Ser 


Asp 


Thr 


Val 


z y4 






275 






9 ft n 

z 0 u 










285 








z y / 


He 


Gin 


Ser Thr 


Pro Thr 


Lys 


Arg 








Val 


Asp 


He 


Gin 


Asn 


z y o 




290 






Z -7 O 










300 










i ni 


Lys 


Gin 


He Asn 


Leu Ala 


Cpy 
OC1 


Ala 


Ala 


Thr 


Asn 


Val 


Thr 


Glv 


Tvr 

JL 


Val 


J Uz 


305 






310 










11 R 










320 


iUj 


Ser 


Glu 


Thr Asp 


Ser Arg 








Ala 


r\CD JL J. 


Ser 


Leu 


Asp 


Ser 


Ala 










325 








1 1 n 










1 m 




1 OQ 

J \J w? 


Val 


Leu 


Leu Val 


Gin Ser 




Asn 


Lys 


Ser 


Asn 


Arg 


Asn 


Glv 


His 


His 


t i n 
jIU 






340 








14 ^ 










350 






on 
jij 


He 


Ser 


Asp Pro 


Asn Leu 




Ser 


Ser 


He 


Ser 


Leu 


Lys 


Phe 


Ala 


Pro 


j «L *± 






355 






160 

J D U 










365 








11 7 


Glu Asp 


Thr Ala 


His Asn 




Leu 


Thr 


Ser 


Gin 


Glu 


Asn 


Val 


Glv 


Pro 






370 






17R 










380 










191 


Gin 


Val 


Thr Thr 


Thr Ser 


Leu 


Ser 


Asn 


Met 


Thr 


Val 


Ala 


Glu 


Ser 


Pro 


199 


385 






390 










395 










400 


19R 
j z. ~> 


Arg 


Thr 


Asp Thr 


Pro Arg 


Glu 


He 


Asn 


Glv 


Leu 


Val 


Asp 


Ser 


Ser 


Val 


19 6 
-J z, o 








405 








410 










415 




19 Q 


Thr 


Asn 


Gly Asn Glu Lys 


Phe 


Ser 


Val 


Glu 


He 


Met 


Asn 


Asp 


Ser 


Asn 








420 








425 










430 






333 


Lys 


He 


Gly Leu Asn Pro 


Lvs 


Ser 


Phe 


Thr 


Asp 


Glu 


Glu 


Arg 


Glu 


He 


114. 






435 






440 










445 








337 


Leu 


Thr 


Leu Phe 


Arg Asn 


Pro 


Pro 


Met 


Arg 


Leu 


Ser 


Ser 


Glu 


Pro 


Pro 


lift 

J J o 




450 






455 










460 










341 


Ser 


Ser 


Asn Gly 


Phe Ser 


He 


Ala 


His 


Pro 


Asn 


Asn 


Ser 


Pro 


Leu 


Arg 


342 


465 






470 










475 










480 


345 


Pro 


Pro 


Ser Leu Gin Gly 


He 


Leu 


Asn 


Ala 


Glu 


Asp 


Arg 


Pro 


Tyr 


Glu 


346 








485 








490 










495 




349 


He 


Glu 


Pro Ser 


Arg Ser 


Ser 


Phe 


Ala 


Thr 


Asn 


Asp 


Thr 


Gly 


Ser 


Tyr 


350 






500 








505 










510 






353 


Asn 


Asn 


Leu Glu 


Leu Leu 


Ser 


Ser 


Val 


Thr 


Asn 


Leu 


Lys 


Ser 


Pro 


Asn 


354 






515 






520 










525 








357 


Glu 


Asn 


Asp Arg 


Val Thr 


Lys 


Thr 


Gin 


Ser 


Arg 


Arg 


Glu 


Thr 


Lys 


Val 


358 




530 






535 










540 










361 


Lys Arg 


Arg Arg Lys Ala 


Arg 


He 


Gin 


Glu 


Thr 


Ser 


Glu 


Glu 


Ser 


Thr 


362 


545 






550 










555 










560 
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365 Val Val Asn Glu Pro Asn Glu Lys Pro Asp Gly Arg Ser Arg Arg Glu 

366 565 570 575 

369 Arg Lys Lys Val Asn Tyr Ala Leu Pro Gly Leu Arg Thr Lys Leu Arg 

370 580 585 590 

373 Arg Asn Phe Asp Leu Pro Ser Asp His Val Lys Ala Lys Lys Thr Arg 

374 595 600 605 

377 Arg Ala Pro Lys Asn Ser Glu Asn Asp Ser Ala Thr Lys Thr Glu Thr 

378 610 615 620 

381 Ala Asn lie Thr Ser Glu Ala Pro Thr Thr Ser Glu Val Thr Leu Glu 

382 625 630 635 640 

385 Asn Ser Glu Thr Leu Asn Leu 

386 645 

389 <210> SEQ ID NO: 5 

390 <211> LENGTH: 1773 
3 91 <212> TYPE: DNA 

392 <213> ORGANISM: yeast 
394 <400> SEQUENCE: 5 

3 95 atgccgaaga gaaaaattgc tcctaacaag gaaagcagca ggcgtacggt ctcccacgat 60 

3 97 gatttaaccc cacaaataca agaatttcaa aacctaatgg atctcgaatc gcaaaaagtg 120 

399 gaaaacatca gacagtcgta ttcgaggcaa aactccctgc tggccaagga taactccata 180 

401 ttaaaaatta aagttaatag cttggaaaaa aaaataagcc agctggtaca agaaaacgtg 240 

403 actctacgat ctaaaacctc tataagcgaa gctatctaca gggaacggtt aagtaatcaa 300 

405 ctacaagtca ttgaaaacgg tattattcaa agatttgacg aaatttttta tatgtttgag 360 

407 aacgtacgta aaaacgaaaa tttgcccagt tcgagcttaa gaacaatgtt gaagagaacg 420 

409 agttccaggt caagatcatg ctcattgtca tcacccacat actcaaaaag ttacactagg 480 

411 ttatcaaatc acgagaataa cctgtcgcat gaatcaagtt ttaacaagga cgatggtcca 540 

413 gatcttgagc ctaaggctaa aaaaaggaag agttctaggc ggcaatctat gtttgtatcc 600 

415 acgagtttag aacctgaaga cgaaaccggt gaaaacgaac ccatgatgga aaattcctct 660 

417 gtagaggtac cggcagaatc acacgagtct gcgcaagtgg aggaaacaat agatgcctta 720 

419 aaccctgaag aggaaaatag cgattctgtc agtaatttta ccaattcaat tatagaatac 780 

421 tccataccag aggagaatcc gacagaaccc gagcattcat cttctaaact agaaatattc 840 

423 aatgacagta caaatatgct aagtacagtg ccgtcaaatc ctttgccgtt gcctttacca 900 

425 ggcccatccg caactttacc tactaccact agcgatgctt caacggtcta tccttcatca 960 

427 agttcttcta ctaattctca tccaaagacc aaaattaagc attccatgaa gccgcctagg 1020 

429 atagaactga agaaaaaggt tattgacgaa gtcatgcccg taagtaacat gagcagcaac 1080 

431 agcgaaatat catttacgag aactagaaga actcgtggta aagctgtaga ttacactttg 1140 

433 ccttctttaa gagccaaaat gaggaggcct tcagaaaaac ttgtggatgc tactactgtg 1200 

435 attgatatac atgatctaca ggtttccaag agaaatcggg aaacttcaca taaaaggaaa 1260 

437 agtttatccc aagattcaat acccgacgaa ccgcaattga gagaagtcgt cgtctcaaag 1320 

439 gattatggaa ctccaaaagg gaaaaaaacg gaagatgaaa tacacgagga taccgctcat 1380 

441 ctaatgacca cttccaacaa caacagcaac aacaaaaacg aaaaaaaact aactagcaac 1440 

443 aatagcccta aaaaatcgtc gcctttactt gacattacaa ataaatcgga gaataagaaa 1500 

445 aagtcaacaa gaactaaaaa attgttcaaa aatgcaattg tcaataattt atctgatgaa 1560 

447 aattctacta cgcgaccctc caagtcgtca aagggaacca gtaataataa caacaattac 1620 

449 aacaatttcg acaataacaa ttcaaacatt aataatgtta ataataaatc tgttagcttt 1680 

451 agactaaatg aagatgattt agcagtattt gatttatttg gaaatggtaa ggcagtgaaa 1740 

453 catcaaccaa aaacatatcg caccaaaaaa tga 1773 

456 <210> SEQ ID NO: 6 

457 <211> LENGTH: 590 
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Invalid <213> Response: 

Use of "Artificial" only as "<213> Organism" response is incomplete, 

per 1.823(b) of New Sequence Rules. Valid response is Artificial Sequence. 

Seq#:40 
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